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GenCore version 6.3 
Copyright (c) 1993 - 2009 Bioccelerat ion Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



April 28, 2009, 04:16:32 ; Search time 908 Seconds 
(without alignments) 
98090.438 Million cell updates/sec 

US-10-529-592A-1 

881 

1 gggccatgacccccgctgct aaataaagatcctctgtaac 881 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 14594359 seqs, 50548307366 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



GenEmbl : * 
gb_env : * 
gb_pat : * 
gb_phg : * 
gb_pln : * 
gb_pri : * 
gb_rod : * 
gb_sts : * 
gb_syn : * 
gb_una : * 



http://es/ScoreAccessWeb/G«tItem.action?AppId=1052959...27_122913_us-10-529-592a-l.rge&ItemType=4&startByte=0(l of 35)5/19/2009 9:51:03 AM 



SCORE Search Results Details for Application 10529592 and Search Result 20090427_122913_us-10-529-592a-l. 



10 


gb_ 


_vr 1 : * 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



CQ800176 881 bp DNA 

Sequence 1 from Patent WO2004031411 . 
CQ800176 

CQ800176.1 GI:46849094 



PAT 28-APR-2004 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Nakamura,Y. and Katagiri,T. 

TITLE Genes and polypeptides relating to human pancreatic cancers 

JOURNAL Patent: WO 2004031411-A 1 15-APR-2004; 

Oncotherapy Science, Inc. (JP) ; Japan as represented by The 
President of the University of Tokyo (JP) 
Location/ Qualifiers 
1. .881 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 
163. .393 

/note="unnamed protein product" 
/codon_start=l 
/protein_id="CAG27584. 1" 
/db_xref ="GI : 46849095" 

/ translat ion="MGLKMSCLKGFQMCVSSSSSSHDEAPVLNDKHLDVPDI I ITPPT 
PTGMMLPRDLGSTVWLDETGSCPDDGEIDPEA" 



Query Match 



Score 881; DB 2; Length 
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Best Local Similarity 100.0%; Pred. No. 7.1e-254; 

Matches 8 81; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

Qy 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 18 0 

181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 4 0 

2 41 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 3 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 41 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 3 0 0 

3 01 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3 01 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 48 0 

4 81 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 54 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4 81 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 54 0 

5 41 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5 41 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 0 0 

6 01 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 01 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 72 0 
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721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

7 81 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 4 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

7 81 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 4 0 

8 41 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 

I I I I I I I I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I I 
8 41 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 



CS502806 881 bp DNA 

Sequence 1 from Patent WO2007013358 . 
CS502806 

CS502806.1 GI:138982984 



PAT 02-APR-2007 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

1 

Nakamura,Y., Katagiri,T. and Inaki,K. 

Vivit polypeptides, therapeutic agent comprising the same, and 
method of screening for anti-cancer agent 
Patent: WO 2007013358-A 1 01-FEB-2007; 

The University of Tokyo (JP) ; Oncotherapy Science, Inc. (JP) 
Location /Qualifiers 
1. .881 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref ="taxon : 9606 " 
163. .393 

/note="unnamed protein product" 
/codon_start=l 
/protein_id="CAM83960 . 1" 
/db_xref ="GI : 138982985" 

/ translat ion="MGLKMSCLKGFQMCVSSSSSSHDEAPVLNDKHLDVPDI I ITPPT 
PTGMMLPRDLGSTVWLDETGSCPDDGEIDPEA" 



Query Match 100.0%; Score 881; DB 2; Length 881; 

Best Local Similarity 100.0%; Pred. No. 7.1e-254; 

Matches 881; Conservative 0; Mismatches 0; Indels 0; Gaps 



1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 
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I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

6 1 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
6 1 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 18 0 

181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 4 0 

2 41 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 3 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 41 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 3 0 0 

3 01 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3 01 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 48 0 

481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 54 0 

5 41 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5 41 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 0 0 

6 01 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 01 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 72 0 

721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 
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7 81 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 4 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

7 81 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 4 0 

8 41 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 

I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I 
8 41 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



DD295744 881 bp DNA linear PAT 16-AUG-2006 

GENES AND POLYPEPTIDES RELATING TO HUMAN PANCREATIC CANCERS. 
DD295744 

DD295744.1 GI:112721854 
JP 2006500947-A/l . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 881) 
Nakamura,Y. and Katagiri,T. 

GENES AND POLYPEPTIDES RELATING TO HUMAN PANCREATIC CANCERS 
Patent: JP 2006500947-A 1 12-JAN-2006; 

ONCOTHERAPY SCIENCE INC, JAPAN AS REPRESENTED BY THE PRESIDENT OF 
THE UNIVERSITY OF TOKYO 



OS 
PN 



PF 
PR 



Homo sapiens 
JP 2006500947-A/l 
12-JAN-2006 

12-SEP-2003 JP 2004541224 

30-SEP-2002 US 6 0 / 4 1 48 72 , 2 8-FEB-2 0 03 US 
yusuke nakamura, toyomasa katagiri 
CC 

FH Key Location/Qualifiers 
FT CDS (163) . . (390) . 

Location /Qualifiers 

1. .881 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 



60/450889 PI 



Query Match 100.0%; Score 881; DB 2; Length 881; 

Best Local Similarity 100.0%; Pred. No. 7.1e-254; 

Matches 881; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 
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6 1 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
6 1 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 18 0 

181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 4 0 

2 41 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 3 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 41 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 3 0 0 

3 01 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3 01 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 48 0 

481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 54 0 

5 41 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5 41 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 0 0 

6 01 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 01 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 72 0 

721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

7 81 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 4 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 781 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 840 



3 41 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 

I I I I I I I I I I I I I I I I I I I I I I I I II 1 I I I I I I I I I I I I I I 
3 41 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 



CQ800178 8 93 bp DNA 

Sequence 3 from Patent WO2004031411 . 
CQ800178 

CQ800178.1 GI:46849096 



PAT 28-APR-2004 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Nakamura,Y. and Katagiri,T. 

TITLE Genes and polypeptides relating to human pancreatic cancers 

JOURNAL Patent: WO 2004031411-A 3 15-APR-2004; 

Oncotherapy Science, Inc. (JP) ; Japan as represented by The 
President of the University of Tokyo (JP) 
Location /Qualifiers 
1. .893 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 
197. .259 

/note="unnamed protein product" 
/codon_start=l 
/protein_id="CAG27585 . 1" 
/db_xref="GI : 46849097" 
/translation^' MGLKMS CLKAAAAAT TRPP S " 



Query Match 93.9%; Score 82 7; DB 2; Length 8 93; 

Best Local Similarity 97.5%; Pred. No. 1.4e-237; 

Matches 859; Conservative 0; Mismatches 0; Indels 22; Gaps 1; 

Qy 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 5 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 9 4 



Qy 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 95 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 154 
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Qy 121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 155 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 214 

181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

215 TGCCTGAA AGCAGCAGCAGCAGCCACGACGAGGCCCCC 252 

2 41 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 3 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 53 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 312 

3 01 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 
I I I I I I I I I I I I I I I I I I I I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

313 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 3 72 

361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3 73 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 43 2 

421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
433 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 49 2 

481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 54 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4 93 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 552 

5 41 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

553 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 612 

6 01 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

613 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 6 72 

661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 72 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 73 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 73 2 

721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
733 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 79 2 

7 81 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 4 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

7 93 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 852 

8 41 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 
I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I 

853 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 893 
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CS502808 893 bp DNA 

Sequence 3 from Patent WO2007013358 . 

CS502808 

CS502808.1 GI:138982991 



linear PAT 02-APR-2007 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 

Nakamura,Y., Katagiri,T. and Inaki,K. 

Vivit polypeptides, therapeutic agent comprising the same, and 
method of screening for anti-cancer agent 
Patent: WO 2007013358-A 3 01-FEB-2007; 

The University of Tokyo (JP) ; Oncotherapy Science, Inc. (JP) 
Location /Qualifiers 
1. .893 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 
197. .259 

/note="unnamed protein product" 

/codon_start=l 

/protein_id="CAM83961 .1" 

/db_xref ="GI : 138982992" 

/ translat ion="MGLKMSCLKAAAAATTRPPS " 



Query Match 93.9%; 
Best Local Similarity 97.5%; 
Matches 859; Conservative 



Score 82 7; DB 2; 
Pred. No. 1.4e-237; 
0; Mismatches 0; 



Length 8 93; 



Indels 



22; 



1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 
I I I I I I I I I I I I I I I I I I I I I I I I II i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3 5 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 9 4 

6 1 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

9 5 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 15 4 

121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

155 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 214 



181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 4 0 
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I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

215 TGCCTGAA AGCAGCAGCAGCAGCCACGACGAGGCCCCC 252 

2 41 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 3 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 53 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 312 

3 01 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
313 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 3 72 

361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 
I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3 73 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 43 2 

421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
433 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 49 2 

481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 54 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4 93 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 552 

5 41 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 60 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
553 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 612 

6 01 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
613 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 6 72 

661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 72 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 73 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 73 2 

721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
733 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 79 2 

7 81 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

7 93 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 852 

8 41 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 

I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I 
853 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 893 



RESULT 6 
DD295745 

LOCUS DD295745 893 bp DNA linear PAT 16-AUG-2006 
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DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



GENES AND POLYPEPTIDES RELATING TO HUMAN PANCREATIC CANCERS. 
DD295745 

DD295745.1 GI:112721855 
JP 2006500947-A/2 . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 893) 
Nakamura,Y. and Katagiri,T. 

GENES AND POLYPEPTIDES RELATING TO HUMAN PANCREATIC CANCERS 
Patent: JP 2006500947-A 2 12-JAN-2006; 

ONCOTHERAPY SCIENCE INC, JAPAN AS REPRESENTED BY THE PRESIDENT OF 

THE UNIVERSITY OF TOKYO 

OS Homo sapiens 

PN JP 2006500947-A/2 

PD 12-JAN-2006 

PF 12-SEP-2003 JP 2004541224 
PR 30-SEP-2002 US 60/414872,28 
yusuke nakamura, toyomasa katagiri 

CC 

FH Key Location/Qualifiers 
FT CDS (197). .(256). 

Location /Qualifiers 
1. .893 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref ="taxon : 96 06" 



-FEB-2003 US 



60/450889 PI 



Query Match 93.9%; 
Best Local Similarity 97.5%; 
Matches 859; Conservative 



Score 82 7; DB 2; Length 8 9 
Pred. No. 1.4e-237; 
0; Mismatches 0; Indels 



Gaps 



Qy 



1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 
I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3 5 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 9 4 



Qy 

Db 

Qy 
Db 
Qy 
Db 



6 1 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

9 5 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 15 4 

121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

155 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 214 

181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

215 TGCCTGAA AGCAGCAGCAGCAGCCACGACGAGGCCCCC 252 
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2 41 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 3 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I! 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 53 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 312 

3 01 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
313 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 3 72 

361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3 73 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 43 2 

421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
433 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 49 2 

4 81 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4 93 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 552 

5 41 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 0 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
553 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 612 

6 01 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
613 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 6 72 

661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 72 0 
I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 73 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 732 

721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
733 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 79 2 

7 81 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

7 93 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 852 

8 41 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
853 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 893 



RESULT 7 
DD131945/C 

LOCUS DD131945 614 bp DNA linear PAT 04-NOV-2005 

DEFINITION Diagnosis and Prognosis of Breast Cancer Patients. 
ACCESSION DD131945 
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VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



DD131945.1 GI:92825448 
JP 2005500832-A/2531 . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 614) 

Bernards, R., Vi j ver , M . J . V . D . , Veer,L.J.V., Robert s , C . J . , Mao , M . , 
Linsley,P.S. , He , Y . , Dai,H. and Hart, A. A.M. 

Diagnosis and Prognosis of Breast Cancer Patients 
Patent: JP 2005500832-A 2531 13-JAN-2005; 
Rosetta Inpharmatics 
COMMENT OS Homo sapiens 

PN JP 2005500832-A/2531 

PD 13-JAN-2005 

PF 14-JUN-2002 JP 2003505588 

PR 18-JUN-2001 US 60/298918, 14-MAY-2002 US 60/380710 PI 

rene bernards, marc j van de vi jver, laura johanna van ' t veer, PI 

Christopher j roberts,mao mao, peter s linsley , yudong he, PI 

hongyue dai, 

PI a a m hart 

CC 

FH Key Location/Qualifiers. 
FEATURES Location/Qualifiers 
source 1. .614 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref ="taxon : 96 06" 



Query Match 69.4% 
Best Local Similarity 100.0 
Matches 611; Conservative 



Score 611; DB 2; Length 614; 
; Pred. No. 2.2e-172; 
0; Mismatches 0; Indels 



Gaps 



Qy 



2 71 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 33 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
614 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 555 



Qy 



331 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 3 9 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
55 4 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 49 5 



Qy 
Db 
Qy 
Db 



3 91 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 45 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

49 4 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 43 5 

451 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 510 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

43 4 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 3 75 
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511 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 5 7 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3 7 4 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 315 

5 71 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 63 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
314 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 255 

631 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 69 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2 5 4 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 19 5 

6 91 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 75 0 

I I I I I I I I I I I I I I I I I I I I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19 4 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 13 5 

751 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 810 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
13 4 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 75 

811 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 8 7 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 4 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 15 

871 TCCTCTGTAAC 881 
I I I I I I I I I I I 
14 TCCTCTGTAAC 4 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



DD417196 614 bp DNA linear PAT 26-MAR-2007 

Diagnosis and Prognosis of Breast Cancer Patients. 

DD417196 

DD417196.1 GI:134130622 
JP 2006519591-A/2531 . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorciata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 614) 
He , Y . and Veer,L.J.V. 

Diagnosis and Prognosis of Breast Cancer Patients 

Patent: JP 2006519591-A 2531 31-AUG-2006; 

Rosetta Inpharmatics LLC JUERGEN HERMANN NETT 

OS Homo sapiens 

PN JP 2006519591-A/2531 

PD 31-AUG-2006 

PF 15-JAN-2004 JP 2006500977 
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PR 15-JAN-2003 US 10/342887 

PI yudong he,laura johanna van ' t veer 

CC 

FH Key Location/Qualifiers. 
FEATURES Location/Qualifiers 
source 1. .614 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 



Query Match 6 9.4%; Score 611; DB 2; Length 614; 

Best Local Similarity 100.0%; Pred. No. 2.2e-172; 

Matches 611; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

y 2 71 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 33 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 614 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 555 

y 331 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 39 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 55 4 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 49 5 

y 391 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 450 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 494 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 435 

y 451 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 510 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 43 4 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 3 75 

y 511 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 570 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 3 74 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 315 

y 571 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 630 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 314 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 255 

y 631 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 690 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 254 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 195 

y 691 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 75 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 194 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 13 5 

y 751 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 810 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 13 4 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 75 
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Qy 811 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 8 70 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 4 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 15 

Qy 871 TCCTCTGTAAC 881 

I I I I I I I I I I I 
Db 14 TCCTCTGTAAC 4 



RESULT 9 

EA064706/C 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



FEATURES 

source 



EA064706 614 bp DNA linear PAT 07-FEB-2007 

Sequence 2531 from patent US 7171311. 

EA064706 

EA064706.1 GI : 125176216 

Unknown . 

Unknown . 

Unclassified . 

1 (bases 1 to 614) 

Dai,H., He, Y . , Linsley,P.S. , Mao , M . , Robert s , C . J . , Van ' t Veer, L.J. , 

Van cie Vijver,M.J., Bernards, R. and Hart, A. A.M. 

Methods of assigning treatment to breast cancer patients 

Patent: US 7171311-A 2531 30-JAN-2007; 

Rosetta Inpharmatics LLC and Netherlands Cancer Institute; Seattle, 

WA; 

US; 

Location /Qualifiers 
1. .614 

/organism= "unknown" 
/mol_type="genomic DNA" 



Query Match 6 9.4%; 

Best Local Similarity 100.0%; 
Matches 611; Conservative 



Score 611; DB 2; Length 614; 
Pred. No. 2.2e-172; 
0; Mismatches 0; Indels 



0; Gaps 



0; 



2 71 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 33 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
614 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 555 



Qy 

Db 

Qy 
Db 
Qy 



331 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 3 9 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

55 4 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 49 5 

3 91 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 45 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

49 4 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 43 5 

451 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 510 
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I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
43 4 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 3 75 

511 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 5 7 0 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3 7 4 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 315 

5 71 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 63 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
314 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 255 

631 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 69 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2 5 4 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 19 5 

6 91 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 75 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19 4 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 13 5 

751 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 810 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
13 4 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 75 

811 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 8 7 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 4 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 15 

871 TCCTCTGTAAC 881 
I I I I I I I I I I I 
14 TCCTCTGTAAC 4 



AC123908 185321 bp DNA linear PRI 28-MAR-2003 

Homo sapiens chromosome 16 clone CTD-2542L18, complete sequence. 
AC123908 

AC123908.3 GI : 29336199 
HTG . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

1 (bases 1 to 185321) 

DOE Joint Genome Institute, 
Alamos National Laboratory. 
Direct Submission 
Unpublished 

2 (bases 1 to 185321) 



Stanford Human Genome Center and Los 
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DOE Joint 
94598, USA 



DOE Joint 
94598, USA 



AUTHORS DOE Joint Genome Institute. 
TITLE Direct Submission 

JOURNAL Submitted ( 05-JUN-2002 ) Production Sequencing Facility, 
Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 
REFERENCE 3 (bases 1 to 185321) 

AUTHORS DOE Joint Genome Institute. 
TITLE Direct Submission 

JOURNAL Submitted ( 06-SEP-2002 ) Production Sequencing Facility, 
Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 
REFERENCE 4 (bases 1 to 185321) 

AUTHORS DOE Joint Genome Institute, Stanford Human Genome Center and Los 
Alamos National Laboratory. 

TITLE Direct Submission 

JOURNAL Submitted ( 2 8-MAR-2 0 03 ) DOE Joint Genome Institute, 2800 Mitchell 
Drive, Walnut Creek, CA 94598, USA 
COMMENT On Mar 28, 2003 this sequence version replaced gi: 22748376. 

Draft Sequence Produced by DOE Joint Genome Institute 
www . jgi . doe . gov 

Finishing Completed at Stanford Human Genome Center and Los Alamos 
National Laboratory 
www-shgc . Stanford . edu 

Quality: Phrap Quality >=40 99.9% of Sequence; 
Estimated Total Number of Errors is 0.1. 
FEATURES Location/Qualifiers 
source 1. .185321 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref ="taxon : 96 06" 
/chromosome=" 16 " 
/clone="CTD-2 5 42L18" 



Query Match 6 2.3%; Score 549; DB 5; Length 185321; 

Best Local Similarity 100.0%; Pred. No. 1.2e-153; 

Matches 549; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



333 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 3 9 2 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 4 9 73 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 124914 



Qy 393 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 452 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 124913 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 124854 

Qy 453 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 512 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 124853 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 124794 

Qy 513 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 5 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 124793 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 124734 
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5 73 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 63 2 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 4 733 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 12 46 7 4 

633 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 692 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 46 73 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 12 4614 

6 93 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 752 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 4613 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 12 455 4 

753 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 812 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 4553 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 12 449 4 

813 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 8 72 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 4 4 93 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 12 4 434 



873 CTCTGTAAC 881 
I I I I I I I I I 
124433 CTCTGTAAC 124425 



AC018695 211001 bp DNA linear PRI 09-JAN-2002 

Homo sapiens BAC clone RP11-568J23 from 16, complete sequence. 
AC018695 

AC018695.6 GI : 13443271 
HTG . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 211001) 

AUTHORS Sulston,J.E. and Waterston,R. 
TITLE Toward a complete human genome sequence 

JOURNAL Genome Res. 8 (11), 1097-1108 (1998) 
PUBMED 98470 74 
REFERENCE 2 (bases 1 to 211001) 

AUTHORS Haakenson, W. , Scott, K., Courtney, L., Drone, K., Gregory, S. and 
Lesley, K . 

TITLE The sequence of Homo sapiens BAC clone RP11-568J23 

JOURNAL Unpublished (2001) 
REFERENCE 3 (bases 1 to 211001) 
AUTHORS Waterston,R.H. 
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TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



Direct Submission 

Submitted ( 16-DEC-1999 ) Genome Sequencing Center, Washington 
University School of Medicine, 4444 Forest Park Parkway, St. L 
MO 63108, USA 

4 (bases 1 to 211001) 
Waterston,R.H. 
Direct Submission 

Submitted ( 2 4-MAR-2 0 0 1 ) Genome Sequencing Center, Washington 
University School of Medicine, 4444 Forest Park Parkway, St. L 
MO 63108, USA 

5 (bases 1 to 211001) 
Waterston, R. 

Direct Submission 

Submitted ( 0 7-NOV-2 0 0 1 ) Department of Genetics, Washington 
University, 4444 Forest Park Avenue, St. Louis, Missouri 63108 

6 (bases 1 to 211001) 
Waterston, R. 

Direct Submission 

Submitted ( 09-JAN-2002 ) Department of Genetics, Washington 
University, 4444 Forest Park Avenue, St. Louis, Missouri 63108 
On Mar 24, 2001 this sequence version replaced gi:9838280. 
Genome Center 

Center: Washington University Genome Sequencing Center 

Center code: WUGSC 

Web site: http://genome.wustl.edu/gsc 
Contact : sapiens @wat son. wust 1 .edu 

Summary Statistics 

Center project name: H_NH0568J23 



NOTICE: This sequence may not represent the entire insert of this 
clone. It may be shorter because we only sequence overlapping 
clone sections once, or longer because we provide a small overlap 
between neighboring data submissions. 

This sequence was finished as follows unless otherwise noted: 
all regions were double stranded, sequenced with an alternate 
chemistry, or covered by high quality data (i.e., phred quality >= 
30); an attempt was made to resolve all sequencing problems, such 
as compressions and repeats; all regions were covered by sequence 
from more than one subclone; and the assembly was confirmed by 
restriction digest. 

MAPPING INFORMATION: 

Mapping information for this clone was provided by Dr. John D. 
McPherson, Department of Genetics, Washington University, St. Louis 
MO. For additional information about the map position of this 
sequence, see http://genome.wustl.edu/gsc 

SOURCE INFORMATION: 

The RPCI-11 human BAC library was made from the blood of one male 
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donor, as described by Osoegawa,K., Woon,P.Y., Zhao,B., Frengen,E. , 
Tateno,M., Catanese, J . J . and de Jong, P.J. (1998) An improved 
approach for construction of bacterial artificial chromosome 
libraries. Genomics 51:1-8. The clone may be obtained either from 
Research Genetics, Inc. (http://www.resgen.com) or Pieter de Jong 
and coworkers at the Roswell Park Cancer Institute 
(http: //bacpac.med.buf falo.edu) 
VECTOR: pBACe3 . 6 

NEIGHBORING SEQUENCE INFORMATION: 

The clone seguenced to the right is RP11-542M13. Actual start of 
this clone is at base position 1 of RP11-568 J23 ; actual end is at 
base position 211001 of RP11-568J23. 

H_NH0568J23 contains an imperfect GA run from 143896 to 144293, in 
which the exact length is unknown. There are approximately 800 
bases missing according to the restriction digests and per data. 



FEATURES 

source 



repeat, 
repeat, 
repeat, 
repeat, 
repeat, 
repeat, 
repeat, 
repeat, 
repeat, 
repeat, 
repeat. 



H_NH0568J23 contains a single stranded region from 144079 to 144172 
which contains low guality data. 

Location /Qualifiers 

1. .211001 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref ="taxon : 9606" 
/chromosome=" 16 " 
/map="16" 

/clone="RPll-568J23" 
/clone_lib="RPCI-ll" 
234. .361 
/ rpt_f amily="Alu" 
407. .467 
/rpt_f amily="MIR" 
760. .835 
/rpt_f amily="MIR" 
891. .929 

/rpt_f amily="MERl_type" 
939. .1114 
/rpt_f amily="Ll " 
1115. .1254 
/rpt_f amily="Alu" 
1255. .1555 
/rpt_f amily="Alu" 
1530. .1555 
/rpt_f amily=" (A) n" 
1556. .1723 
/rpt_f amily="Alu" 
1724. .2020 
/rpt_f amily="Ll " 
2023. .2332 
/rpt_f amily="Alu" 



.region 



.region 



.region 



.region 



.region 



.region 



.region 



.region 



.region 



.region 



.region 



http://es/ScoreAccessWeb/GetItem.action?AppId=105295..J_122913_us-10-529-592a-l.rge&ItemType=4&startByte=0 (23 of 35)5/19/2009 9:51:03 AM 



SCORE Search Results Details for Application 10529592 and Search Remit 20090427_122913_us-10-529-592a-l.n 



repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

misc_f eature 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

misc_f eature 

misc_f eature 

misc_f eature 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 



2334. .2455 

/rpt_f amily="MERl_type" 

2672. .2776 

/rpt_f amily="Ll " 

2793. .3021 

/rpt_f amily="Ll " 

3022. .3340 

/ rpt_f amily="Alu" 

3024. .3051 

/rpt_f amily=" (T) n" 

3207. .3218 

/note="match to EST AA760709 (NID : g2809639 ) nzl3cll.sl" 

3341. .3612 

/rpt_f amily="Ll " 

3613. .3913 

/rpt_f amily="Alu" 

3914. .4009 

/rpt_f amily="Ll " 

4010. .4303 

/rpt_f amily="Alu" 

4291. .4349 

/rpt_f amily="AT_rich" 

4367. .4677 

/rpt_f amily="Alu" 

5175. .5260 

/rpt_f amily="MERl_type" 

5722. .5819 

/rpt_f amily="MIR" 

5916. .6022 

/rpt_f amily="L2 " 

6092. .6370 

/ rpt_f amily="Alu" 

6374. .6523 

/rpt_f amily="Alu" 

7188. .7617 

/note="match to EST AA666253 (NID : g2620866 ) ac42c06.sl" 
7234. .7630 

/note="match to EST R27299 (NID : g783 43 4 ) yh40dl2.sl" 
7464. .7775 

/note="match to EST T57377 (NID : g659238 ) yb51a03.rl" 

7609. .7690 

/rpt_f amily="MIR" 

7716. .7870 

/ rp t_f ami 1 y = "ERVL " 

7893. .7975 

/rpt_f amily="MERl_type" 

8057. .8158 

/ rp t_f ami 1 y = "ERVL " 

8334. .8424 

/rpt_f amily="MIR" 

8499. .8616 
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misc_f eature 

repeat_region 

misc_f eature 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 



/rpt 
9224. 
/note= 
9373. 
/rpt 
9455. 
/note= 
9577. 
/rpt 
9817 
/rpt 
9939. 
/rpt 
10041 
/rpt 
10173 
/rpt 
10309 
/rpt 
10590 
/rpt 
10612 
/rpt 
1118 
/rpt 
11236 



f amily="MIR" 
. 9229 

"match to EST BE883809 (NID : gl 033 2 5 8 5 ) ' 
. 9517 
f amily="MIR" 
.10166 

: "match to EST BG385747 (NID : gl32 78 9 0 1 ) ' 
. 9698 



f amily="MaLR" 

.9909 
f amily="MaLR" 

.10037 
f amily="MIR" 

10172 
f amily="Alu" 

10308 
f amily="Alu" 
. .10611 
f amily="Alu" 

10611 
f amily=" (A) n" 

10797 
f amily="Alu" 
. .11243 
f amily="MaLR" 
. .11344 



Query Match 62.3%; 
Best Local Similarity 100.0%, 
Matches 549; Conservative 



Score 549; DB 5; Length 211001; 
Pred. No. 1.2e-153; 
0; Mismatches 0; Indels 0; 



Gaps 



Qy 333 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 392 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1796 0 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 17901 



Qy 
Db 
Qy 
Db 
Qy 
Db 
Qy 
Db 



3 93 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 452 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

17900 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 17841 

453 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 512 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

17 8 4 0 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 17 7 81 

513 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 5 72 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

17 7 8 0 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 17 721 

5 73 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 63 2 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

17 720 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 17661 
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633 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 6 9 2 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
176 6 0 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 176 01 

6 93 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 752 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
17600 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 17541 

753 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 812 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
175 4 0 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 17 4 81 

813 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 8 72 
I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
17 4 8 0 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 17 421 

873 CTCTGTAAC 881 
I I I I I I I I I 
17420 CTCTGTAAC 17412 



AC009068 160356 bp DNA linear PRI 27-APR-2001 

Homo sapiens chromosome 16 clone RP11-314K3, complete sequence. 
AC009068 

AC009068.10 GI:13811892 
HTG . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 

Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 160356) 

AUTHORS DOE Joint Genome Institute and Stanford Human Genome Center. 
TITLE Direct Submission 

JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 160356) 

AUTHORS DOE Joint Genome Institute. 
TITLE Direct Submission 

JOURNAL Submitted ( 03-AUG-l 9 9 9 ) Production Sequencing Facility, DOE Joint 
Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA 
REFERENCE 3 (bases 1 to 160356) 

AUTHORS DOE Joint Genome Institute and Stanford Human Genome Center. 
TITLE Direct Submission 

JOURNAL Submitted ( 2 7-APR-2 0 0 1 ) DOE Joint Genome Institute, 2800 Mitchell 
Drive, Walnut Creek, CA 94598, USA 
COMMENT On Apr 27, 2001 this sequence version replaced gi: 12000278. 

FEATURES Location/Qualifiers 
source 1. .160356 
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/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref ="taxon : 9606" 
/chromosome=" 16 " 
/clone="RPll-314K3" 
misc_feature 1. .160356 

/note="Draft Sequence Produced by DOE Joint Genome 

Institute; 

www . jgi . doe . gov; 

Finishing Completed at Stanford Human Genome Center; 
www-shgc .Stanford . edu ; 

Quality: Phrap Quality >=40 99.6% of Sequence; 
Estimated Total Number of Errors is 0.3.; 
STS Content : ; 
WI-11796 G24048; 
SHGC-37143 G30481" 



Query Match 62.1%; Score 547.4; DB 5; Length 160356; 

Best Local Similarity 99.8%; Pred. No. 3.6e-153; 

Matches 548; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



333 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 3 9 2 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
9 810 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 79 751 



3 93 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 452 
I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 9 750 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 79691 



453 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 512 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 9 6 9 0 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 79 631 



513 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 5 72 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 9 63 0 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 79 5 71 



5 73 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 63 2 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
795 70 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 79511 



633 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 692 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 9 510 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 79 451 



6 93 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 752 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

7 9 450 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTCCC 79391 
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753 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 812 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 9390 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 79331 

813 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 8 72 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 933 0 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 792 71 

873 CTCTGTAAC 881 
I I I I I I I I I 
79270 CTCTGTAAC 79262 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AC202720 189224 bp DNA linear HTG 29-MAY-2008 

Pan troglodytes chromosome 16 clone CH251-467D6, WORKING DRAFT 
SEQUENCE . 
AC202720 

AC202720.2 GI:189163682 

HTG; HTGS_PHASE1; HTGS_DRAFT; HTGS_ACTIVEFIN . 
Pan troglodytes (chimpanzee) 
Pan troglodytes 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Pan. 

1 (bases 1 to 189224) 
Wilson, R.K. 

Direct Submission 

Submitted ( 22-APR-2 0 0 7 ) Genetics, Genome Sequencing Center, 4444 
Forest Park Parkway, St. Louis, MO 63108, USA 

2 (bases 1 to 189224) 
Wilson, R.K. 

Direct Submission 

Submitted ( 2 9-MAY-2 0 0 8 ) Genetics 
Forest Park Parkway, St. Louis, 



Genome Sequencing Center, 
MO 63108, USA 



On May 29, 2008 this sequence version replaced gi : 145587712 . 



Genome Center 

Center: Washington University Genome Sequencing Center 

Center code: WUGSC 

Web site :http: //genome .wustl .edu 

Contact : submissions@watson. wustl .edu 

Project Information 

Center project name: C_AB0467D06 



Summary Statistics 

Sequencing vector: M13; 0% 
Sequencing vector: plasmid; 100% 
Chemistry: Dye-primer ET; 0% of reads 
Chemistry: Dye-terminator Big Dye; 100% of reads 
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Assembly program: 
Consensus quality: 
Consensus quality: 
Consensus quality: 



Phrap; version 0. 
188692 bases at 
188913 bases at 
189064 bases at 



990319 
least Q40 
least Q30 
least Q20 



FEATURES 

source 



misc_f eature 



NOTE: This is a 'working draft' sequence. It currently 
consists of 1 contigs. The true order of the pieces 
is not known and their order in this sequence record is 
arbitrary. Gaps between the contigs are represented as 
runs of N, but the exact sizes of the gaps are unknown. 
This record will be updated with the finished sequence 
as soon as it is available and the accession number will 
be preserved. 

1 189224: contig of 189224 bp in length. 

Location /Qualifiers 

1. .189224 

/organism="Pan troglodytes" 
/mol_type="genomic DNA" 
/db_xref="taxon: 9598" 
/chromosome=" 16 " 
/clone="CH2 51-46 7D6" 
1. .189224 

/note="assembly_name : Contig5 ; 
clone_end : SP6 ; 
vector_side : right" 



Query Match 61.6%; 
Best Local Similarity 99.3%; 
Matches 545; Conservative 



Score 542.6; DB 16; Length 189224; 
Pred. No. le-151; 
0; Mismatches 4; Indels 0; Gaps 



Qy 333 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 392 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 116593 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 116534 



Qy 3 93 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 452 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 116533 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 116474 

Qy 453 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 512 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 116 4 73 GGGGGTGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAATCC 116 414 

Qy 513 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 5 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 116 413 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCAAGGAATCCAGCCTGCCCACTTCCAG 116354 

Qy 573 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 632 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 116353 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 116294 
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633 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 6 9 2 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
116 2 93 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 116 23 4 

6 93 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 752 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
116 233 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 11617 4 

753 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 812 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
116173 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 116114 

813 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 8 72 
I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
116113 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 116 0 5 4 

873 CTCTGTAAC 881 
I I I I I I I I I 
116053 CTCTGTAAC 116045 



CS539581 447 bp DNA 

Sequence 168 from Patent W02 0 0 7 0 1 2 8 1 1 . 
CS539581 

CS539581.1 GI:145584734 



linear PAT 20-APR-2007 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 

Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Maitland,N. and Bryce,S. 
TITLE Stem cell markers 

JOURNAL Patent: WO 2007012811-A 168 01-FEB-2007; 
Procure Therapeutics Limited (GB) 
Location /Qualifiers 
1. .447 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 



Query Match 49.1% 
Best Local Similarity 100.0 
Matches 433; Conservative 



Score 433; DB 2; Length 447; 
Pred. No. 1.2e-118; 
0; Mismatches 0; Indels 



Gaps 



1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 
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I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

15 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 74 

6 1 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
75 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 13 4 

121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
13 5 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 19 4 

181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
195 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 25 4 

2 41 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 3 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 55 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 314 

3 01 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
315 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 3 7 4 

361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3 75 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 43 4 

421 CTCCTGCTCCAGC 433 

I I I I I I I I I I I I I 
435 CTCCTGCTCCAGC 447 



DNA linear HTG 14-APR-2007 

WORKING DRAFT SEQUENCE, 4 ordered 



AC198304 195676 bp 

Macaca mulatta clone CH250-65C9 
pieces . 
AC198304 

AC198304.4 GI:145279267 

HTG; HTGS_PHASE2; HTGS_DRAFT; HTGS_FULLTOP . 
Macaca mulatta (rhesus monkey) 
Macaca mulatta 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Cercopithecidae; Cercopithecinae ; Macaca. 
1 (bases 1 to 195676) 

Muzny , D . M . , Abraham, K . K . , Abulimiti, A. , Adams, C.Q., Aduba,G., 
Allen, C.C., Alsbrooks, S.L. , Anosike, U . S . , Archer, P.M., 
Arredondo, H . H . , Attaway,T., Bandaranaike, D . P . , Bangura,L., 
Barton, S.R., Bell, A. V., Bell, S.N. , Beraducci , A . R . , Bickham,C, 
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Biswalo,K., Blyth,P.R., Buhay,C.J., Canada, A., Cardenas , V . , 
Carter, K., Chacko,J., Chandrabose, M . N . , Chavez, A., Chavez, D., 
Chen,G., Chen,R., Chu,H., Clerc blankenburg, K . P . , Cockrell,R., 
Cooper, J. A., Coyle,M.D., Cree,A., Cueto,C.B., Curry, S.M., Dai,W., 
Dao , M . D . , Davila,M., Davis, C, Davy-Carroll , L . , Del fierro,P., 
Demen,R., Denson,S., Ding,Y., Dinh,H.H., Donlin,J.E., 
Dugan-Rocha, S . , Dunn, A.M., Durbin,K.J., Ebong,V.E., Egan,A., 
Espinosa, V. C . , Fa, M . , Fernandez , S . , Fernando, P . R . , Ferrer, A. R., 
Flagg,N., Forbes, L.D., Fowler, R.G., Fu,Q., Fuh, E . , Gabisi,R.A., 
Ganardhanan, M. , Ganer,J., Garcia iii,R.M., Garcia, A.M., 
Garcia, S.M., Garner, T.T., Ghose,S., Gingras,M., 

Gonzalez-Garay, M. L . , Guevara, W.V. , Haaland, W . C . , Haeberlen, K . A . , 
Hagans,B.J., Hall,0., Hamid,H., Hamilton, K . A . , Hampton, 0 . A . , 
Harbes,B.A., Harris, R. A., Havlak,P., Hawes,A.C, Hawkins , E . S . , 
Haynes,S.J., Hemphill, L., Hernandez , J . , Hines,S., Hirani,K., 
Hitchens,M.E. , Hodgson, A. V. , Hogues,M.E., Holder, M., Hollins,B., 
Howell, L.L., Hulyk,S.W., Hume, J., Jackson, A., Jackson, L . R . , 
Jacob, S.K., Jhangiani , S . N . , Jiang, H., Johnson, B., Johnson, R., 
Joshi,V., Joy,C, Kaikai,F.B., Kalaf us , K . J . , Kalu,J.B., Kang,Y., 
Keebler,J., Khan,Z.M., Kidwai,S., King,L.M., Kisamo,H., Kovar,C.L., 
Kowis,A.N., Kowis,C.R., Lago, L . A . , Lago, M . T . , Lai,C, Lara,F., 
Le , T . T . , Lee,S.L., Lee,T.W., Legall iii,F.H., Lemon, S.J. , 
Lewis, L.R., Li , B . , Li , Y . , Li,Z., Linnell , M . A . , Liu, J., Liu,W., 
Liu,Y., Liu,Y., Liyanage,D., London, P., Lopez, J., Lorensuhewa, L .M. , 
Lozado,R.J., Luc,T., Madu,R.C, Maheshwari , M . , Maheshwari , R . , 
Malloy,K., Mansouri, D.L. , Martinez, E., Mate jkova, P . , Mathew,T., 
Mccauley, S . K . , Mcpherson, J. D. , Mercado,C, Mercado, I . C . , 
Metzker, M. L . , Millin,A., Milosavl jevic, A. , Morgan, M.B., Morris, S., 
Munidasa,M., Murray, D.D., Muzny , D . M . , Nazareth, L . V . , Ngo,D.N., 
Nguyen, H.T., Nguyen, N.B., Nguyen, P. Q., Nwaokelemeh, 0 . 0 . , 
Obregon,M., Odeh,E.A., Okonkwo,F., Okwuonu, G . 0 . , Okwuonu, K . C . , 
Onyenekwe, J. , Parish, B. J., Parker, D.N. , Parra,A.A., Pasternak, S . , 
Patel,B.M., Patel,R.R., Paul, H. A., Perez, A., Perez, L.M., 
Perez, Y.Y., Pham,T.L., Player, E. J., Primus, E.L., Pu,L., Puazo,M., 
Purkiss,C, Qin,X., Quiroz,J.B., Rabata,D., Rachlin, E . K . , Ren,Y., 
Richards, S., Rojas,A., Ruiz,S., Sabo,A., Sant ibanez , J . , 
Savery,G.G., Scherer , S . E . , Schneider, B .W. , Sebasigari , R . , 
Sexton, M.M., Shen,H., Shen,Y., Sisson,I., Sneed,A.J., Sodergren, E . , 
Song,X., Sorelle, R.P. , Svatek,A.F., Taylor, E.W., Taylor, T.R., 
Thelus,R., Thomas, N., Thorn, R.D., Thornton, R . D . , Tong, M . Y . , 
Trejos, Z . Y. , Usmani,K., Vargo,C.E., Vattathil , S . , Vega, R. A., 
Villasana, D . , Volkov,A., Walker, D.L. , Wang,Q., Wang,S., 
Warren, J. T., Watt, J. E., Wei,X., Wheeler , D . A . , White, C.S., Williams 
jr,R.L., Williams, A. C . , Williams , G . A . , Williams , J . D . , Wilson, K., 
Woodworth, J.R. , Worley,K.C, Wright, R. A., Wu,J., Wu,W., Yakub,S., 
Yerrapragada, S . , Yu,F., Yuan, D . T . , Yuan,Y., Zhang, J., Zhang, L., 
Zhang, Z., Zhou, J., Zhu,Y., Weinstock,G. and Gibbs,R.A. 

TITLE Direct Submission 

JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 195676) 

AUTHORS Worley,K.C. 
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TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



Direct Submission 

Submitted ( 16-FEB-2 0 0 7 ) Human Genome Sequencing Center, Department 

of Molecular and Human Genetics, Baylor College of Medicine, One 

Baylor Plaza, Houston, TX 77030, USA 

3 (bases 1 to 195676) 

Worley,K.C. 

Direct Submission 

Submitted ( 1 4-APR-2 0 0 7 ) Human Genome Sequencing Center, Department 
of Molecular and Human Genetics, Baylor College of Medicine, One 
Baylor Plaza, Houston, TX 77030, USA 

On Apr 14, 2007 this sequence version replaced gi : 133 7543 72 . 
Genome Center 

Center: Baylor College of Medicine 

Center code: BCM 

Web site: http://www.hgsc.bcm.tmc.edu/ 

Contact: hgsc-help.tmc.edu 
Project Information 

Center project name: LEYX 

Center clone name: CH250-65C9 
Summary Statistics 

Sequencing vector: Plasmid; 

Chemistry: Dye-terminator Big Dye: 100% of reads 

Assembly program: Phrap; version 0.990329 

Consensus quality: 200531 bases at least Q40 

Consensus quality: 200802 bases at least Q30 

Consensus quality: 200989 bases at least Q20 

Estimated insert size: 206589; sum-of -cont igs estimation 

Quality coverage: 9x in Q20 bases; sum-of -cont igs estimation 



NOTE: Estimated insert size may differ from sequence length 

( see http : / /www . hgsc . bcm. tmc . edu/docs/Genbank_draf t_data . html ) 
The sequence data in this record represents an 'enhanced' version 
of a Phase 2 submission. The indicated order and orientation of 
each sequence has been established using one or more of the 
following: read-pair data from individual subclones, overlaps 
with neighboring clones, alignment with available reference 
sequence (e.g., human), and/or confirmation by PCR testing. 
NOTE: This is a 'working draft' sequence. It currently 
consists of 4 contigs. Gaps between the contigs 
are represented as runs of N. The order of the pieces 
is believed to be correct as given, however the sizes 
of the gaps between them are based on estimates that have 
provided by the submittor. 
This sequence will be replaced 

by the finished sequence as soon as it is available and 
the accession number will be preserved. 

1 142140: contig of 142140 bp in length 
142141 142840: gap of 700 bp 

142841 154919: contig of 12079 bp in length 
154920 155019: gap of 100 bp 
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FEATURES 

source 



misc_f eature 
misc_f eature 
gap 

misc_f eature 
misc_f eature 
gap 

misc_f eature 
misc_f eature 
gap 

misc_f eature 
ORIGIN 



5020 160424: contig of 5405 bp in length 
0425 161324: gap of 900 bp 

1325 195676: contig of 34352 bp in length. 
Location /Qualifiers 
1. .195676 

/organism="Macaca mulatta" 
/mol_type="genomic DNA" 
/db_xref ="taxon : 9544" 
/clone="CH2 5 0-65C9" 
1. .142140 

/ note="assembly_name : Contig5 8 " 
142141. .142840 
/ note="assembly_name : gap" 
142141. .142840 
/estimated_length=70 0 
142841. .154919 

/note="assembly_name : Contig42 " 
154920. .155019 
/note="assembly_name : gap" 
154920. .155019 
/estimated_length=10 0 
155020. .160424 

/note="assembly_name : Contig4 0 " 
160425. .161324 
/note="assembly_name :gap" 
160425. .161324 
/estimated_length=900 
161325. .195676 

/note="assembly_name : Contig5 9" 



Query Match 45.0%; Score 396.2; DB 16; Length 195676; 

Best Local Similarity 90.8%; Pred. No. 1.6e-107; 

Matches 433; Conservative 0; Mismatches 43; Indels 1; Gaps 



1; 



333 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 3 9 2 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3 6 0 45 AGTCTGGTTGGATGAGACAGGGTCGTGCCCGGATGATGGAGAACTCGACCCAGAAGCCTG 35 9 8 6 



Qy 393 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 452 

I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I 
Db 35985 AGGAGGTGTCGTGGATTTGTCTGGTGGGCTCCTGCTGCGGGGTCCCAGCTTCAGGTGTCC 35926 

Qy 453 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 512 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 35925 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCTTGGATGGGAACTGAGCGAATCT 35866 

Qy 513 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 5 72 

III I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 35865 GGGTCTCCGCTGGGAGAGATGTGGCAGGACCAGCGAGGAATCCAGCCTGCCTGCTTCCGG 35806 
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5 73 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 63 2 

I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
358 05 AACAGTGTTTCCC-GACCCTGCTGAGTGGACTGGACCTCTGACACCTCCAGGGTCTTGCT 35 74 7 

633 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 692 
I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
35 746 GACTCCAGCCTGGTGCAAGGGAGCTCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 356 8 7 

6 93 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 752 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
3 5686 TCTCTTCAGGACAAACACACCTTCCCAGCCCCACGGGCTGTGCAAATACGTGCCCCTGCC 35627 

753 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTT 809 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I 
3 5626 ATAAGCACAAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTTTTTTTTTTTTT 35570 
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